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Abstract. We consider the problem of finding small prime gaps in various sets C C N. Following 
the work of Goldston-Pintz-Yildmm, we will consider collections of natural numbers that are well- 
controlled in arithmetic progressions. Letting q n denote the n-th prime in C, we will establish 
that for some "small" constant e > 0, the set {qn\qn+i — Qn < elogn} constitutes a positive 
proportion of all prime numbers. Specific examples, such as the case where C is an arithmetic 
progression have already been studied and so the purpose of this paper is to present results for 
general classes of sets. 

1. Introduction and framework 

One of the most famous unsolved problems in Number Theory is the so-called Twin Prime Con- 
jecture, which posits the existence of infinitely many pairs of primes for which p — p' = 2. 
Throughout the past century a great amount of work has been done with regards to this conjecture 
and we refer the reader to [3] and [13] for some historical background on the subject. In this article 
we will build upon the methods developed in recent years by Goldston, Pintz and Yildirim. Letting 
p n denote the n-th prime number, it was shown in [5] that 

,. . , Pn+l ~ Pn n 
limmf 71 \T/2+£ = 

n-+oo (10gp„)U^+ £ 

for any e > and, regarding the frequency of small prime gaps, it was demonstrated in [7] that 

N 

I {Pn < N Pn+l ~Pn < V log N} | > C- - 

log A* 

for some constant C depending on 77 and any natural number N . In this paper we will investigate 
which conditions ensure that a set CcN will also have many small prime gaps, in the latter sense. 

Definition 1.1. Let q n denote the n-th prime number in C and for r/ > 0, write 

Tt C (N; n) := I {q n < N\ q n+1 - q n < nlogN} \. 

We say that C frequently contains n-small prime gaps if there exists a constant C , depending on r], 
such that 

log iv 

for all positive integers N > 1. 

From a probabilistic point of view, let us first show that a random subset of the primes, with positive 
relative density p, frequently contains 77-small prime gaps (for any rj). To avoid issues of indepen- 
dence, we will work with the sets Vc{N; rj) W := {p n < N\p n ,p n+ ± € C,n = i mod 2 and p n +\ — p n < rylogiV} 
for i = 1,2. For fixed N, denote tt^(N; rj) for the cardinality of the largest of these two sets. 
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Lemma 1.2. Let B be a random subset of the primes defined by letting the independent events 
p € B occur with probability ¥(p 6 B) = p. Writing X(N, rj) := p 2 ir^(N; rj) > p 2 ir^(N\ rf)/2 one has 
that 

y »7 fc>i Af>/c ^ J y 

where rj is made to run over the fractions (l/n)„ e R. 

Proof. For fixed AT and 77, write M := n^(N;rj) and let X ~ B(M, \(N,rj)) denote a Binomial 
random variable ( i.e. a sum of M independent Bernoulli trials, each of which yields success with 
probability p 2 ). Observe that for any natural number k one has that 

p(n* B (N- lV ) ( M ) ( P 2 y(i p 2 ) m - j = nx < k). 

It now follows after a simple application of Chernoff's inequality (see for instance ;16j Theorem 1.8] 
) that 

P(X < A/2) < P(\X - E(X)\ > A/2) < e- x 
and to conclude the proof of {1}, we need only invoke the Borel-Cantelli lemma. □ 

Despite these heuristics one can piece together large subsets of the primes which have only large 
prime gaps. From [71 Theorem 3] we get the existence of a constant C > such that 

I {p n < N\ p n+1 -p n <h}\< Cmin (r^jj, A tt(JV) 

for any pair of positive integers N and h. In particular, any hope of obtaining a result of the form 
"If COT has positive relative density in the primes then C has small prime gaps" is dashed. Indeed, 
taking h = log N/2C in the statement just above, we gather that the collection of primes p n for 
which Pn+i — Pn > l/2Clogp n constitutes a positive proportion of all prime numbers. It thus 
becomes apparent that some kind of structure must be imposed on C if we wish to get primes in 
short intervals. We will explore two scenarios in which we are able to control the interaction of C 
with arithmetic progressions. In each setting we provide a somewhat general result and then give 
some examples of sets obeying the desired properties. The two main examples are 

Theorem 1.3 (Bohr sets have small prime gaps). Let q(x) = ~}2!j=a a j x ^ P\x] and suppose all 
coefficients ay are Diophantine. Then for any d £ (0, 1) and rj > 0, the Bohr set 

A;={neN\ {fl(n)} e [0,d] } 

frequently contains rj-small prime gaps. Here, {x} denotes the fractional part of a real number x. 

Theorem 1.4 (Shifted sets of square- free integers have small prime gaps). Let n > and a 6 N 
be arbitrary. Then the set of shifted square-free integers 

B := {n G N | n + a is square-free} 

frequently contains rj-small prime gaps. 

Remark 1.5. In Theorem \1.3\ one expects that the statement remains true as long as the coefficient 
old is irrational, however with our current methods this seems out of reach. We will discuss this in 
greater detail in section 6. 
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The case where C is an arithmetic progression has been studied by several authors. These results, 
which will be stated in section 6, differ from our own in the sense that we obtain small prime gaps 
frequently, as opposed to "infinitely often" . 

Notation Wc introduce some standard notation that will be used throughout the paper. For 
functions / and g we will use the symbols / <C g and / = 0(g) interchangeably to express Landau's 
big O symbol. A subscript of the form <C, means the implied constant may depend on the quantity 
1). The statement / ~ g means / and g are asymptotically equivalent, i.e., lim. r _j. 00 f(x)/g(x) = 1. 
For two positive integers /, m we write I x m when 1/2 < m < 21. We reserve the letter \i for the 
Mobius function and A for the von Mangoldt function. In place of A we will sometimes use the 
funcion 

, . _ J log n if n is prime 
[0 otherwise. 



2. The main results 

2.1. Type A sets. We first consider sets A C Z which exhibit an even distribution among 
arithmetic progressions of any given modulus. Fix a natural number k. We say that a fc-tuple 
W = (hi,...,hk) has height h if each member is bounded in size by h. A set A will be of type A& if 
it exhibits the following properties. 

(a) (Estimates for progressions in A) 

There exist constants c, d > such that, for all positive integers N, 



l = c k -+TZ A (N,a,q) 

n<N,n+HCA 9 
n=a mod q 

and for any e > 0, the remainder term satisfies the bound 

(2) V max \K A (N, a, q)\ « £ N 1 ^' 

q<Q 

in the range Q < ATV2-£ and uni f orm i y 

over all fc-tuples % of height h < log A. 

(b) (Estimates for primes in „4) 
We have that 



(3) 



J2 A(n)=c k N + o( 



N 



(log NY 



n<N,n+HCA 

for any C > 0. In addition, for every pair A, B > 0, we have the estimate 



(4) 



E E 

q<Q' x mod q 
X^Xo 



E A (n)x(n) 



n<N 
n+HCA 



N 



{\ogN) B 



uniformly in the range Q' < (log N) A . The index \ mod q runs over Dirichlet characters of 
modulus q and \o denotes the trivial character. 
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(c) (A bilinear form estimate for ^4) 

There exists a constant C = C(A) such that 



(5) ^-W) £ 

q<Q ^ yHJ x modg 



^2 ^2a m bix(ml) 



m<M 1<L 
ml+UcA 



« (M + Q 2 ) 1/2 (L + Q 2 )V2 (i og MI) C ||a|| 2 ||6|| 



for any pair of integers M, L. The sum ^* indicates we are summing over primitive 
characters and ||.|| 2 denotes the £ 2 norm of a sequence. 
In this setting we will prove the following result. 

Theorem 2.1. Any set A C N of type Afc frequently contains rfk-small prime gaps, for some 
r\k *C 1/cfc. Here, c is the constant appearing in condition (a). 

By way of comparison, consider a type A\ set A. The asymptotic tells us that A frequently 
contains rj small prime gaps whenever r\ > 1/c. In other words, Theorem 12.11 is relevant for large 
values of k. With some additional structure on A, however, we can obtain an improved result for 
type Ax sets. 

Definition 2.2. Let Q be a finite set of primes and write V(G) — Yi p eg P- We say that a set ^cN 
is locally well distributed if it satisfies estimates of the type 

}2 ly(n) = \-+Ky{N,a,q) 

n<N 1 
n=a mod q 

for some A > and there exists a constant e > such that 

. , N 

L nrax|^(7V,a,< Z )|« ^— ^ 

/or Q < iV e and A > arbitrary. 

Definition 2.3. Lei .4 6e a subset of the natural numbers for which we can write A' + y C A, 
where A is a type Ai set and 3^ is locally well distributed. Then we say A = (A;A,y) is of type 
A'. 

Theorem 12. If . Let A be a type A' set. Then for any rj > 0, A frequently contains rj-small prime 
gaps. 

We will apply these theorems to the case where C is a Bohr set. In section 6 we will prove that, 
after a somewhat careful selection of fc-tuples H, conditions (a) — (c) hold. From this, Theorem II .31 
will follow. For linear Bohr sets (i.e. when D = 1) we will make use of Theorem l2.1[ . It should also 
be noted that, trivially, N is of type Ak for any fc g N, so that we recover the work of Goldston, 
Pintz and Yildirim. 

2.2. Type B sets. We will call Bc2a type B set if it has the form 

B = P| [J {n G N| n = a mod m} , 
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where COT is a collection of pairwise co-prime integers and associated to each m £ 9Jt is a selection 
of residue classes 9Jt m C Z m . We will write N m for the complement of 9Jl m and assume that 

' Lp(m) x K 
mean ^ v y 

m>a: 

for some k > and any a; > 1. In this particular setting we will show that 

Theorem 2.4. Let B C N be a type B set satisfying the conditions listed above and let 7/ > be 
arbitrary. Then B frequently contains rj-small prime gaps. 

Remark 2.5. Consider a type B set to which there are coupled infinitely many m 6 To deal 
with sums of the form 5Z p ls(p), we are essentially counting primes in arithmetic progressions. 
However, since 9PX contains infinitely many moduli and we are not able to process a large quantity 
of remainder terms, venturing down this avenue would pose a problem. Instead we shall work with 
"approximate type B sets B(z) ", which are collections of the form 

(6) B(z):= P| (J {n£N\ n = a mod m} . 

mean a&xn m 

At this point we note that Theorem 11.41 follows easily from Theorem 12.41 but we will present the 
examples in section 6. 

An outline of the strategy 

To prove the main Theorems we will largely follow the framework set up in [4], [7] and [13] , save 
some minor modifications. Let "H C N with \H\ = k and for any such set, define the polynomial 

Pu(n) := l[(n + h). 
hen 

Also, let I < k and write V{z) :— \\ v<z p. From now on C will denote either a type A& set A or 
an approximate type B set B(z). In the latter case write T for the collection of primes dividing 
:= rimeOT m<z m an d f° r tyP e Afc sets we take T to be empty. When possible, we will treat 
both types of sets in a unified manner. 

As in [4] , the Theorems are proven by evaluating weighted sums of the form 



(7) E 



/ \ 

£ 6(n + h )-\og(3N) 



n=l 
(P H (n),V(R S ))-- 



hn<h 



£A|(n;H,Z), 



\n+h eC ) 

where A is a natural number, h := 77 log N and Ti runs over all fc-tuples of height h. It should be 
noted that the expression ([7]) can only be positive if there exists an interval of length h in which 
C has at least two primes. In addition, (0 can only be "large" if C has "many" small prime gaps. 
Next we observe that, when splitting the above expression into two parts, the condition n + ho 6 C 
falls only on the first sum. Intuitively, this indicates that an extra factor (the density of C) will 
appear when dealing with this first sum but not for the second sum. To overcome this issue we will 
consider carefully selected fc-tuples H. 

The weight function An denotes a truncated version of the von Mangoldt function 
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1 / i? X k+l 

V ^ d|P„(n) V 

d<Ji,(<i r F)=l 

where R = R(N) is a parameter that will be chosen later. This function should be compared to 

1 / i? x k+l 

W") : =(fcT7)lE^) (log 7 

which counts, in a weighted fashion, those values of n having at most k + I prime factors. 
After establishing a Bombieri- Vinogradov type result in section [31 we will prove two crucial as- 
ymptotic estimates in section SI namely Propositions 12.81 and 12. 91 which are the analogues of [7J 
Propositions 1 and 2] . In section [S] these will be used to obtain an asymptotic formula for (J7J . On 
the other hand we will demonstrate a connection between the quantity (|7J and Trc(N;r/), thereby 
proving our main results. 

Definition 2.6. Let C be a type set A or an approximate type B set B(z). We introduce the 
singular series 

£\ pJ V p J£\ p 

where Q p (7i) denotes the number of distinct residue classes occupied by % inX p . 

Definition 2.7 (Selecting fe-tuples). Fix k £ N and let H = {hi, hk} be a k-tuple. We will write 
H € f)fe if> f or each 1 < i < k, we have hi = mod V(k). 

Proposition 2.8. Fix k G N and letC be a type Aj. set or an approximate type B setB(z). Suppose 
N Cl <C R^+s (log A)~ C2 for some constants Ci depending on k and I x \fk. Furthermore, assume 
8 > is sufficiently small compared to fc~ 3 / 2 , H € f)fc+i is admissible and h <C logi? with h — > oo 
as N — > oo. Then we have 



Y Al(n;H,l) = 7 (C,H) (l + 0(5 2 fc 3 )) Npfj ^£^(l og R)^' 

n<N,n+HCC ' 



When C = B(z), the superscript jj indicates we are selecting those values of n satisfying n + H C 
l~l 9Jt m /or aZZ m £ DJl, m < z. The constant factor is given by 

7 (c,7o= n f^r) n ^ * c = b ^ 



m 

m&xn x ' mean 

m<fc+l fe-(-l<m<js 



andjiCjH) = c k ifC = A. The symbol T m (H) represents the cardinality of the setD k =1 ((Z^ n A4 m ) - 

Proposition 2.9. Let e > be fixed. Given the same conditions as in Proposition 2, we have for 
N C1 < R 1 ^ (log A)~ C2 and H Q := U U {h } £ f) fc+1 admissible 



N 



£ c^+MA^n; «, = 7(C, «°) (l + O(o 2 fc 3 )) A ( + (log *) fc+M , 

(P M (n),P(fl 5 ))=l 
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where uiq = 1 if ho € H and ttiq — if not. 

3. A Bombieri- Vinogradov theorem for type A k sets 

In this section we make the necessary changes to the proof of the Bombieri- Vinogradov theorem to 
ensure an identity of the form 

, r k AT 

J2 e(m) = ^f s + E(N;q,a), 

m=a mod q 

with the error term obeying the bound 

N 



(8) m a ax \E(.N;q,a)\4ZA* 



(\ogN) A 

g<JVl/2-e (o,?) = l 6 > 

for any e > 0. In the next section we will need this result to demonstrate Propositions 12.81 and 12.91 
for type A sets. 

Remark 3.1. To avoid any additional assumptions on type Ak sets we wish to forgo the use of the 
Polya - Vinogradov theorem. This result is used in the proof of the Bombieri- Vinogradov theorem 
and if one were to follow the proof word-for-word in our current setting, one would require a bound 
of the form 

n<N ,n+H<ZA 

for any non-trivial character x mod q with 1 < q < N 1 / 2 ~ e . For this reason, we will rearrange the 
argument in [TJ Chapter 28 ] to better suit our purposes. 

Let us begin by defining the sums 

^a{N,x)-= Y A(m)x(m) and ip A (N;q,a) := Y A ( m ) = "TT E E x( a )M m )x(m) 

l<m<N l<m<N ^'xmodgl<ro<JV 

m+HCA m+HCA m+HCA 

m=a mod q 

Removing the expected main term from ip A (N;q,a), we may write 

Js: 



r K AT 1 

^A{N-,q,a)-- [ - = — x{aW A {N, X ) 



where 



././ (M v ^ = J 4a(N, X) when x Xo 

VaV^X) I iP A (N, Xo )~c k N when X = Xo- 

For small moduli, e.g. q < (\ogN) A , we can estimate the sum 

E^y E i^wx)i 

q<Q ^ yHJ X mod q 

immediately, using the conditions imposed in (b). The resulting contribution is bounded by the 

rhs of m. 
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An estimate for large moduli 

For large values of q < Q we will use the estimate 

Proposition 3.2. For any e > and A > 0, there exists a constant C > such that 



0) E 



Q 



q<Q fiS) («.fl)=i 
in the range Q < N 1 / 2 



$3 iwa^x) 

X mod q 



< 



^2^1/2 + N 5/6 Q + N \ (log q^C 



QN 



(\0g Ny 



Proof. We will rework the argument laid out in [TJ chapter 28] in conjuction with the inequality®. 
Once © is established, the proof of © is straightforward. 

With this in mind, we decompose our sum, as described in |TJ p. 139], to obtain for x 7^ Xo 

W x ) ■= E AMxM = E A M/M = s 1 + s 2 + s 3 + s 4 



m<N 
m+HCA 



2<N 



where 



Si = E A(n)/(n) « U, S 2 = ]T £ M (d)A(m) £ /(ri), 



n<U 



t<UV \ md=t 

\m<U.d<V 



r<N/t 



Sz=^T[j,(d) V f{dh)\ogh and 5 4 = - V A(m) ^ 



'3 — ^ fa "7 / „ 

d<y h<N/d 



U<m<N/V V<k<N/m 



( \ 

n(d) 

cl\k, 

\d<V I 



E 

d\k 
,d<V 



f(mk). 



From now on we will take U = V = iV e for some constant < e < 1 . To bound the sum 52 we first 
restrict the variable t to the range [1, U] and then to the range [U, UV]. Denote the resulting sums 
as S 2 = S' 2 + S 2 . The sums S' 2 and 53 can be combined to create some degree of cancellation by 
noting that 



£ x(a)(S' 2 + S 3 ) = J2 H"* A )W E E x(a)/(rt)+ M (t) E E x{a)f{rt)\og, 



X mod q 



t<U 



r<N/t X mod q 



r<N/t X mod g 



t<[7 



M(t)logi J] f{rt)+ii{t) /( r< ) lo S'' 



V 



r<N/t 
rt=a mod q 



r<N/t 
rt=a mod q 



*>(«)!>(*) E /mm^) 

t<t/ r<JV/t 

rt=a mod q 



K{x,tq) 



dx 



(*,<?) = ! 
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where * denotes a Dirichlet convolution and the last identity follows from summation by parts. For 
the sum ^ x mo d q (S' 2 + S3) we simply subtract the X = Xo term in the first line. The additional 

x=£xo 

term will be of the same form as the last line, just above, and can be dealt with in the same way. 
This last expression may be recast as 

/ \ 



(10) c£fo>5>(0 J2 ^(N\o g N-N) + 



l\q 



t<U 
£=0 mod I 



V 



f y{q) K{x,tq) — 

Jl t^TT X 



t<U 
(t,q) = l 



To estimate (fT0|) we require the following lemma. 



Lemma 3.3. Let $ c (x) := exp((log x) c ), then for any positive, squarefree integer I < <&iu(x), one 
has the bound 



t<x 
t=0 mod I 



i> 1/4 (x) ' 



where to(l) denotes the number of primes dividing I. 

Proof. The bound is easily demonstrated by induction on lo(1). To be precise, we will show that 

l/4\ 



(11) 



t=0 mod I 



« (2w(0) w(i) exp - ( lo 



(*i/4(a;))*-W 



When oj(1) = and hence I — 1, this estimate is obtained as a consequence of the bound 
ISn<a;M n )l ^ x/$i/a(x) (see for instance [TTJ Chapter II. 5]) followed by partial summation. 
Let us now suppose that (fTTj) has been established whenever w(Z') < r and let Z be such that 
uj(1) = r. Furthermore, given j < r, let lj denote a generic divisor of / having j prime factors. Then 
we have 



\ - m(*) = MO \ - M*) = MO 
^ t 1 ^ t 1 

t<x/l 

(t,i)=i 



£=0 mod i 



EM) 

j<r-l 



MO t^H + O 



t<x/l 
t~Q mod / 



■ E 

j<r-l 



M*) + M*) 

t<x/i ' t<x/l 

t=0 mod i=0 mod / 



MO 



t~0 mod Z 



>r— 1 



o 



-r exp 



(r — 1) J exp 



log 



- log 



1r-2 



1/41 



1/41 



Repeating this process K times yields 



Y M^> 



t<x 
t=0 mod I 



« I t + 2 r - 1 -ar w E^ ex p 



m<K 



- 10 



1/4' 



which gives the desired estimate after selecting the smallest value K for which l K > <J> 1 / 4 (x). □ 
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In order to estimate the first sum in ([TU]) we apply the previous lemma for values of I < $ 1 / 8 (7V) 
(combined with the fact that w(7) <§; \ogl) and use a trivial bound otherwise. Together with © it 
follows that 



y — 



E ^2 + ^3 



X mod q 



N 



(log N) A 



for any A > 0. To estimate S4 and S'2 we will make use of the bilinear form inequality ([5]). Since, 
currently, we are considering sums which run over complete sets of characters for each modulus q, 
these must first be converted to sums involving only primitive characters. 
With this in mind, let \ niod q be generated by the primitive character \i mod q' and write 



a rn = A(m), bk = E v{d)- 

d\k 
d<V 



First observe that 



E 



h [(Xi - x)( mk )} = E a>mhh{mk) - E a m b k fi(mk). 



U<m,k<N/U U<m,k<N/U U<m,k<N/U 

mk<N,mk+H<ZA mk<N mk<N,(mk,q) = 



With regards to 5 4 , it follows that the contribution made by the character xi to the LHS of © 
does not exceed 



E 

<7<Q 
g=0 mod q 



E a m b k fi(mk) 

U<m,k<N/U 
mk<N \{mk,q) — l 



E 

q=0 mod q 



l\q U<m,k<N/U 
rnk<N ,l\rnk 



a m b k fi{mk) 



^ ^ ip(o) 5 

q—0 mod q' 



£ a m b k fi{mk) 

U<7n,k<N/U 
mk<N \l\mk 



«E E 

1<Q q<Q 

q=0 mod q' I 



U<m,k<N/U 
mk<N.l\mk 



a m bkfi(mk) 



and hence we get that 



E 

9<<3 
q=0 mod q' 



E 



a m b k fi{mk) 



U<m,k<N/U 
rnk<N,(rnk 1 q) = l 



«E 



log TV 1 

v(0 W) 



U<m,k<N/U 
mk<N ,l\rnk 



a m b k fi(mk) 



Summing over all primitive characters we conclude that S4 accounts for a total bound of 
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\ - logiV \ - 1 \ - 

4 ' k *® h *w x 



x - logTV x - 1 X - 



^ a m b k fi(mk) 



U<m,k<N/U 
rnk<N ,l\rnk 



a m bkh{mk) ^ l dli i(m)ld 2 ,i(A;) 

U<m.k<N/U (d 1 ,d 2 )eP(l) 
mk<N 



«E 



logiV 



E E v ( q >\ E 



^ amld!,^"!) b k l d2 .i(k)f 1 (mk) 



U<m,k<N/U 
mk<N 



Here, the function Id j(?n) indicates those integers m for which (Z, to) = dandP(7) = {(d 1 ,d 2 ) C N 2 | d x \l, d 2 \l and Z|did 2 }. 
The three inner- most sums can now be dealt with as in [I], except that the estimate ([5]) takes on 
the role of the large sieve. The sum S' 2 ' may be treated as Sa to obtain 

QN 



( 12 ) E -TT E (l 5 al + W4) « ( Q'N 1 ' 2 + ^ + QNUV'V 1 '* + N ) (logiV)^ 
q<Q lf[ - q ' x 



Combining all of the above, we get 

E E ™* liM*. x)\ « (q 2 ^ 2 + n + §^ + uqnW + Q^u) (log QUNf^ mA 



QN 

since U = V. In the range TV 1 / 3 < Q < N 1 ' 2 we choose £/ = N 2 / 3 /Q. In this case it follows easily 
that all terms involving U are bounded by 

Q 2 N 1/2 . 

When Q < TV 1 / 3 we select U = N 1 / 3 . For such values of Q we obtain the bound QN 5 / 6 . This 
concludes the proof of @. To finish the proof of {5} we estimate the sum 



E 



1 



x(*)tPa(n,x) 

X mod <j 



(logAf)^< 9 <Q 

by decomposing the range of q into dyadic cubes and applying ([9]) to each of the resulting sums. 

4. Obtaining the asymptotics 

Given a fc-tuple T~L — {hi, hk}, denote ft(p) for the set of all distinct residue classes of the form 
—hi mod p with hi £ %. When no confusion is possible we will also write fl(p) to denote the size of 
the set. To express the condition p\Pu(n) we write n e f2(p) and extend this definition to squarefree 
integers i.e. d\Pu{n) < ^=> n G il(d). Furthermore, let q — R 13 denote a prime with < f3 < S. In 
order to prove Proposition 12.81 it is enough to demonstrate that 



n<N 
n+HCC 



2A e c (u) 



{k + 2l) 



(logi?) 



fc+22 
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and 

"«<» - cc ^<"^.') « 7^ v <?)|r§ (bgR) ' +3 ' 

«I-Pw(«) 

To complete the argument one then sums the latter bound over all primes q < R 6 . 
The proof of these estimates runs along the same lines as Proposition 1] except for some details 
which we will now point out. When C is a type set, the necessary modifications are very 
straightforward ( except that (JS} assumes the role of the Bombieri- Vinogradov Theorem) so we will 
focus on approximate type B sets. 

Given an approximate type B set B{z), let DJl(z) = Y[q£9Jt 9 an( i take J- to be the set of all primes 

q<z 

dividing 9Jt(z). The superscript jj, featuring in the definition of W\, indicates we are selecting those 
values of n satisfying n + % C n Wl m for all m £ 9Jl 7 m < z. We will only consider tuples 
W = {ho, hi, hk} 6 f)fc+i- Recall that fyk+i consists of tuples satisfying the following condition: 
For i = 0, k, we have hi = mod V(k + 1). 

We will treat Wi(B(z)) and W q (B(z)) simultaneously. Expanding W q (B(z)), we are first led to the 
sum 

£ 1 = ^B^N ^f'f + O ( I] Mm V([du *, q])) ■ 

n<N,n+HcB(z) ^ X ' \rn<z J 

ne n([d 1: d 2 ,q]) 

Here we used the important fact that n + hi e 971 PlZ^ for all i to ensure that the constant r y(B(zj) 
appears. It then follows that 

where the superscript * indicates that {d\,F) = 1 and (d,2,J-) = 1. In the event that g|M(z), 
clearly vanishes and hence we may assume q \ JF . In estimating W q (B(z)) we have incurred an 
error no greater than 

b b 
\Err\^H<m m ^(d)(logi?) 2fe+2 '« z (logi?) 2fc+2/ ]T d 3fe («z(d)) « z i? 2+5+E , 

m<Z d<R 2 + 5 d<R 2 + s 

where sq(d) = lipid? 5 denotes the squarefree part of d, d m (l) represents the number of ways in 
which / can be written as a product of m natural numbers, and e > is arbitrary. Returning to 
the main term in W q (B(z)), we can express the logarithmic factors by means of the discontinuous 
integral 

I f jf_ fO if < £ < 1 

2^y (c)S fe+i i ^(io g ^) fc if x>i 

where c > is arbitrary and (c) denotes the vertical line through c in the complex plane. In doing 
so we find an expression of the form 

(13) W q (B(z)) = 7 (B(*))JV7,(B(z)) + 0(N/(\ogN) B ). 

Here, the integral I q is 
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with 



s \A /i(a 1 )^(a 2 )^(a 12 ) 2 Sl(ai)^(Qi)»(Qi2) 

J{Sl,S 2 )— l+Si 1+S 2 l+Si+Sj 

til tin Lil O 

ai,a2,ai2 ± z ±z 

n(p) n(p) n( P ) 



n (i- 

p0-FU{g} 



pl+Sl pl + S2 pl + Sl+S 2 



Rq(si,s 2 ) :- 



and 

i? Sl+S2 when g = 1 

R st+s 2 _ (R/ q yiRsz _ R s i(R/ q y2 + (R/ q yi+sz when g > 1. 

The superscript ' appearing in the definition of / (si, S2) means we are summing over variables that 
are pair- wise relatively prime. One last manipulation yields 



k 



t 1 f f ri \( C(si + s 2 + l) V R q {sus 2 ) j 

/ -(^y (1) y (1) G ^ s2) (c( S1+ i)c( S2+ i)J (sis2f+ i + t 

where (1) denotes the vertical line through 1 in the complex plane and 

When q = 1 we can apply [4j Lemma 3] since, in our current situation, the product G q {si, s 2 ) differs 
only from its original counterpart by a factor 

n A_nfc)(J_ + J_ 1 

11 ^ p \p Sl p S2 p s l+ s 2 

which is analytic and bounded in the region ai,a 2 > —1/2, say. Seeing how the product G q (si, s 2 ) 
runs over primes p £ J 7 , the singular series Sb( 2 )(K) appears in place of the usual &{%). When 
q > 1, we proceed as in [13] and [7], where again, the only difference will be the apparition of the 
new singular series. 

The proof of proposition 12.91 is accomplished by establishing the estimates 

Wt{C):= 6 ( n + ^(n; U,l)~ 7 (C,H°)N C*}) (log i?) fc + 2i 

n<N,n+-H°GC \ / \ )• 

and 

N 



w, { c>-.= y 9( „ +M A|(„ ;W ,o<<-^^v w ( 2 ;)|iM,(.„ g i ! ) 



k+2l 



q\Pu(n) 

which are valid under the assumptions stated in Proposition 3. One can deal with W q in much the 
same way as W q . Indeed, unraveling W q , one first finds 
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ne f2([di,d 2 ,g]) m<fc+l m>fc+l 



■O ( OT ™ n([d,e,?])^'(Af(2)[d 1 ,da,g];JV) 



where the symbol f2° is associated to the set W°. After expanding W q , the accumulated error terms 
contribute no more than 

( \ 1/2 ✓ \ 1/2 

£ n(d)A I £ S'(d;JV) 
d<_R 2 + a J \d<B 2 + s 



2 



(log TV)- 

TV 



Kd<ATl/2~ 

TV 



(logiV) 5 

Dealing with the main term as in the case of W q , one arrives at an integral I q , of the form 
T 1 f f h< \ ( C(^i + ^2 + l) \ k ~ m R q (si,s 2 ) 

where this time 

where m = 1 if fro € "H and m = if /lo ^ "H. For g = 1 one finds 

p-i 



G(0,0) = 6 8(2) (H°) n 

pg.f ^ 

which accounts for the factor 7(£>(z). H ) in Proposition ^. 91 

5. Completing the proofs of the main theorems 
First let us recall that the singular series associated to % := {hi, hk} is 



ec<*> : =nMy 1-3^ n f'-i 



To finish the proofs of Theorem 12.11 and Theorem 12.41 we require one last ingredient, which is a 
modified version of a result due to Gallagher. 
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Proposition 5.1. Let {A\A',y) be a type A' set for some locally well- distributed sety(Q,X) and 
let B{z) be an approximate type B set. Then 

E e B{z) (H°)^P(F)h £ e B(z) (H)^P(T)e- k2 h k+1 

=1 

and similarly 



\n°\=k+iMehk+i \H\=k,Het)k+i 

hi<h, distinct hi<h, distinct 



E GA(n)^h J2 &A{n)^> g ^e- k2 h k +\ 



|W|=fe+l,W°6h fc +i \H\=k,Het) k+ i 
hi<h, distinct hi<h, distinct 

where P(P) = U. pe:F p/(p- 1). 

Proof. We can simultaneously deal with the first inequalities in each line. Observe that 

-(fc+i) / r> (vow / i \ -(fc+i) 



^*°>=n(i-T n('-— H 1 - 1 



1 — ' ° / l\-( fe +!) 

per v r 7 v <t? v P J \ PJ 

For primes p satisfying £l p (H)° = VL p (H) we get the correct lower bound immediately. When 
QpiW) < p — 2 one has 



1 top(n) i\ = 



1 A_i 



p" 5 



P / \ Py 

so that the desired bound follows easily from our choice of f)fc+i- For the second estimate involving 
B(z), we follow the argument put forth in [3] and write 

6 BW (*)> (n^ i )n(i-^)"(i-^P)>>^ e 

where a q (^-C) is multiplicative and defined as 



(,,-p(fe)=i) 



, MAX p fc -^p(H) P fc ~ 1 -b-l) A 

a p (H) = a(p,T p {H)) = — 



(p-l)» 
for primes p. 

Restricting the series in (fl4)) to values of q < x, it is demonstrated in [3] that, for any e > 

b b 
E 0,(70= E a 9 (^)+0((^) £ A), 

(g,P(fc)=l) (g,p(fc) = l) 

with the implied constant depending only on k and e. It follows that 

b 

E ©(H) = E E a q (U) + 0(h\xhy/x). 

l<hi,...,h k <h l<hi,...,hk<h 
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The inner sum can be expressed as 



(15) 



The sum 1 represents the number of fc-tuples 1 < hi,...,hk < h with % G t)k+i that occupy 
r(p) residue classes for each prime p \ q. The variable r ranges over all tuples (..., r{p), •••)pk with 
1 < r(p) < p. As in [3], we gather that (8) can be written as 



(16) 



where 



n i I - 

/ 



^)=En^.^))( r L)fe}- 

r p\ q V W/ 

The expression in curly brackets represents a Stirling number of the second kind. It is shown in [3] 
that A(q) = when q > 1, so the result follows after setting a; = /i 2e . The same argument works 
when dealing with the set A. 

□ 



To prove Theorems 12.11 and 12.11 . we recall that 

n A (2N; T]) = | {q n < 2N\ q n+1 - q n < X] log N} 

and consider the sum 



S 



1 



□ 



R,A 



,, (1<)( m- E (eA(n,h)~\og(3N))Y,^R(n;n,l), 

^ ° ' N <n<2N 



H 



where 



@c{n,h)= J2 e ( n + h o) 



ho<h 
n+h eC 

and the superscript □ indicates that the summation takes place over admissible fc-tuples H. C [l,h] 
for which (P n (n),T(R 5 )) = 1 and H E fjfc+i. In the case of a type A set, we will also assume that 
each member of % lives in y. The upper bound for Sh,a is obtained in precisely the same manner 
as [7] with the addendum that 

U'(n) < h n A (2N;ri) + 0(Nexp(-cy/fo£N)). 

N<n<2N 
Q(n,h)>3/2logN 

It follows, as in [7], that 

Sr,a « (h tt a (2N; V ) + o( 7 r(27V))) 1 / 2 

(logfl) fc + 2 'logjV (2fc + 2)!_ {2klos3N)/S}ogR 
[ ' N l /*h k [{k + l)l} 2 \S\ogR 



fc/2 



h 



S\ogR 



(fc+2)/2 
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On the other hand, combining the asymptotics with Proposition 5, one finds 

< 18 » » '-"^^'w* tr§Ti?TO + " - 1 + ° {kV) ) ■ 

In the case of a type A' set one gets a constant of the form c^{Q,X)cq instead of cr). By choosing 
k, I x \fk sufficiently large with respect to r\ (fixed), and 6 > sufficiently small it is at once clear 
that Sr > and a small calculation shows that 



tt a (2N;t 1 )^tt(N), 

as desired. 

For the proof of Theorem 12.41 we evaluate the sum 



1 



h{\ogR) k 



E ( Q V(*) ( n > h ) - lo g( 3Ar )) E A «( n; W ' o. 



N<n<2N H 

which leads to the same upper bound as (|17|) . The lower bound is obtained as before and this time 
the bracketed expression in (fl~8f takes on the form 

2k 2Z + llog.R 



7 (B(*))r 



P(J-) 7 '(6(z)) ?7 - 7 (S(z)) + (fc 3 ( 5 2 ) 



fc + 2/ + l Z + l logiV 

Since P(J r )j'(B(z))/ r y(B(z)) is bounded away from zero independently of k and z, this tells us that 

n B ( z )(2N;r))^7r(N), 
where the implicit constant will depend on k and z. We choose 



, fc = 2(f + l)(2i + l), *=i, i? - JV 1/4(1+4) 



and noting that 7(6(2;)) S> l/(logz) fc , we now let z := cxp(fc 4 ). One finds, as in [7], that 

tt b{z) (N: V ) > exp(-cfc 3 )7r(iV). 

To finish the proof of Theorem 12.41 set T> := B(z) n S c and apply the Brun-Titchmarsh Theorem 
(see for example [lOl Theorem 6.6]) to obtain 



tt-d(N; rf) < tt-d(N) < 



1 



<p(q) 



ir(N) < 



tt(A^) 



/ 



6. Some examples 

6.1. Arithmetic progressions. As a first example, we consider arithmetic progressions {n E N | n = a mod q} 
where (a, q) = 1. It is at once clear that the required conditions for a type B set hold and hence 
we get the following corollary. 

Proposition 6.1. Let rj > be arbitrary. Then any arithmetic progression (nq + a) n6 N with 
(a, q) = 1 frequently contains -q- small prime gaps. 

Several authors have studied the case of arithmetic progressions and obtained results similar to the 
above. Firstly, Goldston, Pintz and Yildirim showed in [B] that there are small prime gaps in the 
progression {n < N \ n = a mod q} and one can even let q grow slowly with N. 
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Theorem 6.2. Lets and A be arbitrary fixed positive numbers. Letq and N be arbitrary, sufficiently 
large integers, satisfying 

q (A,e) « q « (loglogTV)- 4 , N > N (A,e), 

and let a be arbitrary with (a, q) = 1. Then there exist primes p,p' G [N/3, N] such that p' = p = 
a mod q and p' — p < e log p. 

T. Freiberg demonstrated, in [8], that it is possible to find consecutive primes in short intervals 
which are both congruent to a mod q. 

Theorem 6.3. Fix any positive number e, and fix a pair of coprime integers q > 3 and a. There 
is an absolute positive constant c such that, for all sufficiently large X, 

i > ^i-c/iogiogx 

p r <X,p r+1 -p r <£ log p r 

p r =p r -^-i=a mod q 

6.2. Shifted sets of fc-free numbers. Our second application of Theorem [23] pertains to shifted 
sets of fc-free numbers, i.e sets of the form 

8 := f]{ne N| a mod/}, 

p 

where a is any fixed integer. Again, it is easily verified that B is a type B set so we obtain Theorem 
Ol 

6.3. Bohr sets. Sequences of the type ({fl(n)}) n eN, with q a polynomial, have been the subject of 
much study. It was demonstrated by H.Weyl that they are uniformly equidistributed in the unit 
interval, provided that the leading coefficient of g is irrational. Later on, I.M. Vinogradov showed 
that the sequence remains equidistributed if one restricts n to prime values. In this section we will 
add yet another result to this subject by proving Theorem 11.31 Let g(x) = Y^f=i a j x ^ e R[x}. A 
Bohr set is a collection of the form ^:={neN {fl( rl )} € [0, d] }. 

Definition 6.4. An irrational number a is of type p > if 

p = sup < 7 <E K | liminf m 7 1 1 -met 1 1 = > . 

Here \\x\\ denotes the distance from x to the nearest integer. A number a which obeys such a bound 
is said to be Diophantine. 

Remark 6.5. Observe that Theorem \1.3\ holds for Lebesgue- almost all D-tuples (ao, ...a_o) G ^ D 
and as a consequence of the Thue-Siegel-Roth Theorem (see [2]j, whenever the aj are algebraic 
irrationals. 

We would like to show that A is a type A& set (for large k ) but there are some immediate algebraic 
obstructions that must be overcome. Consider, for example, the polynomial g(x) — \f2x 2 . Observe 
that the events n € A, n + 1 € A,n + 2 € A and n + 3 G A are not independent, since 

|\/2n 2 | - 3 |V2(n + 1) 2 | + 3 |v^(n + 2) 2 | - jv^ri + 3) 2 | = mod 1. 

In other words, (0) does not even hold for A-tuples. 
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6.3.1. Some background information and implements. We begin by recalling some useful facts from 
the theory of Diophantine approximation (which can be found in 18 ). For the remainder of this 
section we will assume a is Diophantine of type p. Given any positive integer M, the collection 
{m < M | {ma} £ [0,d] } has a very neat combinatorial structure. In order to describe this struc- 
ture, rearrange the natural numbers up to M in such a way that {sia} < {s2«} < ■■■ < {smc*}. 
One has the recurrence relationship 

Sj + si when sj < M — si 

Sj + i = ^ Sj + si — sm when M — Si < Sj < sm 
Sj — sm when sm < Sj. 

Without loss of generality we may assume ||sia|| < ||sA/a|| so that ||smo:||, ||(si — SA/)a|| > 1/4M. 
By Dirichlet's theorem we have that ||sicc|| < 1/M and since a is Diophantine, we easily find that 
si ^$> e M x l p ~ E for any small e > 0. It also follows easily from the above that 

(19) |&<M| ||HI < jj^I+^j C{6<M| 6=0modsi} 

for any c > (and M sufficiently large). Another important notion related to our problem is that 
of discrepancy. Given a sequence x = (x\, xn) in [0, 1] and real numbers < <5 < j3 < 1, write 

A([5,p)) = \{n<N\x n €[5,l3)}\- 
The discrepancy of x is defined to be 



Dn(lu) — sup 

0<<5<^<l 



N 



(13 -S) 



Remark 6.6. Instead of working with the discontinuous characteristic function X[o,d] we shall 
sometimes make use of a smooth cut-off function ip(x) : [0, 1] —> [0, 1] satisfying 



ip(x) 



1 when 5 < x < d — S 
when x £ [0, d}. 

Employing such a bump function will improve the rate of convergence of if) 's Fourier expansion. 
More precisely, if ip is r -times continuously differentiable with ^(O) = ip l (d) = for i < k, one 
easily shows that the Fourier coefficients cu grow like 

1 N rC 

Cfc <C min 



X k r 

provided that we choose 6 := N~ c . In other words, for such a choice of 8 we get rapid convergence 
as soon as k 3> N c . 

The following results will aid us in the verification of conditions (a) — (c). For more details, we refer 
the reader to [11], [12], [9] and [15] respectively. 

Theorem 6.7 (Erdos-Turan-Koksma) . Let x = (x±, xm) be a sequence of real numbers in the 
unit interval [0, 1]. Then for any H e N the discrepancy of this sequence is bounded by 



1 1 1 M 

r>l 



M 

II 
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Theorem 6.8 (Weyl's inequality). Let f(x) £ K[x] be a polynomial of degree k with leading coeffi- 
cient a satisfying \a — r jv\ < 1/v 2 for some pair of coprime integers r, v with v > 0. Then 



E e (/(»)) 



1<N 



< N 



1+e 



1 

N 



-V 

N k J 



for any e > 0. 



Theorem 6.9. Let / G be a polynomial of degree k, with leading coefficient a and suppose 
\a — a/v\ < 1/v 2 with (a, v) = 1 and v < N . Then for any e > 0, 



A' 



£A(n)e(/(n)) 



< AT 



1 



ATI/2 



1/ 

iV 1 



In addition we will use a special case of van der Corput's lemma. 

Lemma 6.10 (van der Corput, special case). Let <j){x) £ C 1 ([a, b}) such that \4>'(x)\ > 1 and <j>'(x) 
is monotonic in (a, b). Then 

" e(<f>{x)) =0(1), 

with the implied constant being independent of a and b. 

Lemma 6.11. For a Diophantine and e > fixed, one has the inequality 

\B a {M,M k+c ,m)\ = \{b< M\ \\mb k a\\ < 1/M k+C } \ < fe , £ M 1-0 ^ 
for any C > and uniformly in the range 1 < m < M s . 

Proof. Let C N denote the set of all integers of the form mn k and write H :— \N C ~\ . According 
to (|19[) there exists an integer Si < M k H such that 

\B a (M, M k+C ,h)\ < \{lsi\ I < M k H/ Sl }Dp k h \. 

Since a is Diophantine, we have a lower bound of the type si > (MH) c ( p \ Combining this fact 
with the above estimate, the result easily follows. □ 

6.3.2. Verifying conditions (a) — (c) for Bohr sets. To begin with, we will need to choose a collection 
t) of k- tuples % = (hi, hk) in such a way that the problem arising in Remark 6.6 can be avoided. 
Consider the first k primes pi, ...,pk and for each pi, let e, be the smallest natural number for which 

< d. We will say "He I) if for each pi, one has that p?* \hj Vj ^ i and (pi, hi) = 1. 
In order to demonstrate a bound of the type stated in condition @, we will consider sums of the 
form 

E I- 

n<N,n=a mod g 

with each „4j C ^4. The sets Ai will vary for each N but we will ensure that \A% \ ^>fe independently 
of TV. Under these circumstances all of the arguments leading to the proofs of our main theorems 
go through. 

Finally, we observe that in the case of a linear Bohr set *4(TV, d) :— {n < N \ {not} £ [0, d] } we 
have the bonus of an additive structure, since A(N, d/2) + A(N, d/2) C A(N, d). 
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Proposition 6.12. Subject to the constraints described above, one has an inequality of the form 

Y,\K A (N,a,q)\<N 1 - c ^ 
q<Q 

for Q = N 1 ' 2 and some constant c > depending on p. 

Proof. Let e > be a small but fixed constant to be chosen later. First we observe that the sequence 
(fl( n ))nGN is uniformly equidistributed mod 1. As a result, there exists an interval / C [d/2 7 d] of 
length |/| < d/3N £ containing no more than 7V 1_e elements of the form g(n) with n < N. We may 
now replace d with the right end-point of I, since reducing d by a factor 2 will have no bearing 
on the statement of Theorem 13. Moreover, for each individual i < k we shorten the interval J, 
obtained above, at its left-end and modify the constant term in g, so that we may assume d = p~ e ' . 
In other words, for each i < k we will work on a shortened interval U := [0,^]. Letting At denote 
the Bohr set associated to Ii, we note that 

fe fc k / k \ 

]J U(n + hi) = Jl 1[oa] (s( n + h i)) = II ^ n + hi ^> + I X + h i>) ) ' 

z— 1 i—l i—l \r— 1 / 

where <5 = N~ e and ip is the truncated Fourier series, as described in Remark l6.6l To deal with the 
remainder term we invoke the Erdos-Turan-Koksma inequality (setting H = N e ) and gather that 



(20) 



q<Q 



hi)mi 



q<Q |m|<7V E n<N/q 
m^O 



q<Q r=l 



^ e (0(nq + a + /j,)r) 

n<JV/g 



0(7V 



l-e\ 



where the subscript m runs over fc-tuples (mi, m^), the Fourier coefficients obey the bound 



n 



|l/mi| and |m| denotes the maximum norm of m. Moreover, when 
p? 4 \mi for all component mi of m, we may assume that ^ 0. To see this, observe that when 

p\ % \mi, the Fourier coefficient c m < 1/N E and the number of fc-tuples m for which = is at 

most 7V(fc-i)e. 



Due to our choice of H, we now see that the polynomial P{x) := ^i=i 9( x 1 + a + ^i) m « appearing 
in (|2"P|) does not collapse to a constant function and in fact has a leading coefficient of the form 
(3(q, m) := ('°)<Z' D_J 'a!A ; _j(^*L 1 h^rrii) for j = or 1. We will deal with the first sum in (|2"0j) and 
note that the second quantity is treated in very much the same way. As a consequence of Weyl's 
inequality, we have 



< 



N 



<1 

D-e 



l-c( E ) 



%<N/q 

as long as one can find natural numbers N e < v < N u ~ e and r > 1 coprime to v, satisfying 

r 1 
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First consider those "good" pairs (q, m) for which f3(q, m) admits a rational approximation as 
above. Applying Weyl's inequality to these pairs, it follows that their contribution to the RHS of 
(|20j) is easily subsumed in the desired bound. Also, since a is Diophantine, we can choose e in such 
a way that (q, m) is good whenever q D < N £ . 

Let us arrange the remaining "bad" pairs (q, m) into dyadic cubes Q < q < 2Q and H < |m| < 2H. 
By Dirichlet's theorem, for each such bad pair, we thus find a v < N c{£ V 2 such that |/3 - r/v\ < 
l/vN D ~ £ . In other words, we can write 



u 



i,<ATc(e)/2 



q< C| ||q D |m|a|| < 



N D ~ 



From the previous Lemma we gather that each set in the union above has size O {Q 1 c ' p ^) (for e 
sufficiently small) so that 



E E * 

Q<q<2Q H<\m\<2H 
(q,m) bad 



E e ( p (")) 

l<N/q 



«(logA^)Q 1 - c W^. 



The proof is completed by summing over all dyadic cubes for which Q > N £ / D . 

When D = 1, the proof proceeds in the same manner except it is no longer necessary to sum over 

fc-tuples m in equation (EU1) . □ 



Lemma 6.13. For Q = {YogN) A , we have 



E^y E 

q<Q ^ W X mod q 



N 



(log N) A 



Proof. Let e > be fixed and expand tp' A (N,x) using the smooth cut-off function discussed in the 
proof of Proposition 16.121 For x Xo this yields 



|m|<Af e n<N \i=l / 

= E ^EE^ A (") e fE0(" + ^ + -)+o(^-) 

m|<AfE T ^'' a=l n<N \i=l 9 / 

where r(x) = Sm=i x( TO ) e ( m /?)j c m *C l/|m| and iJ -C N £ . First consider the case D = 1. Since 
a is Diophantine, we may estimate the exponential sum immediately, using Theorem 16.91 to obtain 
the desired bound. 

When D > 2, we adopt the exact same strategy as in the proof of Proposition 16.121 Organise m 
into dyadic cubes and then consider separately the good and bad pairs (g, m). 
Finally, when m = 0, we have the classical estimate 

E *w A (") « (i^p- 

n<iV v ° ' 

The case x = Xo is treated similarly. □ 
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Our final task is to prove the bilinear form estimate (JSJ) - We recall the following variant of the 
classical large sieve inequality 

(21) 



E— — y^max 
ifi(q) ^ u 

q<Q ^ W X 



Lemma 6.14. 



i<M KL 
mn<u 



« (M + QY /2 (L + Q 2 ) 1/2 ( E l« m | 2 ) 1/2 (El 6 '| 2 ) 1/2l °g( 2Mi )- 



KL 



(22) 



Y—Y 



Y Y ambix ^ m ^ 



m<M KL 
ml+HCA 



« (M + Q 2 ) l ' 2 (L + Q 2 ) 1 ^ 2 (log ML) 3 ||o|| a ||b|| 



Proof. We will assume without loss of generality that L > M. The inner-most double sum in (|2"2"|) 
can be expanded to obtain 



S := Y E Qmfyx(raZ) = Y Y amblX ( m ^ E Cr e IY 9( ml + h i) r i ) 

m<M KL m<M KL |r|<L c i \i=l / 



0(1/ L 2 ) 



ml+HCA 



(23) = E Cr E ^2 a mhx(ml) 

|r|<L c i m<M KL 



F T (ml) 



{ML) 2 



e(x) dx + e((ML) 2 



+ 0(l/L 2 ). 



Here F T (x) = J2i=i9( x + hi) T i an d we shift the leading coefficient by a large integer multiple 
of 27tz to ensure that F r (y) > (AIL) 2 for y > 1. Furthermore, the constant C\ depends only 
on rj, r = (n, rjf) € and regarding the Fourier coefficients, we have c r <C IIr-/o I^A*!' 
Interchanging the order of summation and integration in (|23[) leads to the quantity 

L C 2 (v,K) 

(24) / e(x) E Y a ^ h ^ ml ) 

J (ML) 2 m <Ml<L 
F r (ml)>x 

The condition F T (ml) > x can be replaced by an expression of the form ml € Ui[Zi(F T — x), Zi + \(F r — 
x)\ where Zi(G) denotes the i-th real zero of the function G. It therefore suffices to evaluate (f2"4"|) 
under the new condition Zi(F T — x) < ml or rather H(x) :— \og(Zi(F r — x)) < log(mZ). A simple 
application of the Implicit Function Theorem shows that H'(x) <C 1/ar, with the implied constant 
being independent of the modifications made to F r . Substituting the discontinuous integral (p. 165 
of CP) 



e(at) sin(27r/3<) — 

' t 



l + OCT^CS-lal)- 1 ) if|a|</3 
OtT-^-M)- 1 ) if|a|>/3 



into (|2U)) . we are left with sums of the form 



sin(27r[log(m) + log(f)]t) 



/ e(x + tH(x)) 2 / dmbiximl) dx 

J (ml) 2 Jr^{, 77i 



dt 



m<M KL 



Choosing T — (ML) 3 / 2 , we see that 4-(x + tH(x)) is bounded away from zero and piecewise 
monotonic so that van der Corput's lemma can be applied. Note that, after applying a simple trig 
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identity, we have effectively separated the variables m and I so that the resulting quantities may be 
absorbed into a m and bi. We may now proceed to use (|2Tj) . □ 

Acknowledgements I would like to thank my advisor, Prof. Terence Tao, for his generous en- 
couragement and numerous suggestions throughout this project. 

References 

[1] H. Davenport, Multiplicative Number Theory, third ed., Grad. Texts in Math., vol 74, Springer- Verlag, 2000, 

Revised by H. L. Montgomery. 
[2] H. Davenport, K. Roth, RationaJ approximations to algebraic numbers, Mathematika 2 (1955) 160-167. 
[3] P.X.Gallagher, On the distribution of primes in short intervals, Mathematika 23 (1976), 4-9. 
[4] D. Goldston, J. Pintz and C. Yildirim, Primes in tupies I, Ann. of Math. 170, (2009), 819-862. 
[5] D. Goldston, J. Pintz and C. Yildirim, Primes in tuples II, Acta Math 204, (2010), 1-47. 

[6] D. Goldston, J. Pintz and C. Yildirim, Primes in tuples III: On the difference p n +is — p-n, Functiones et Approx- 
imate. 35, (2006), 79-89. 

[7] D. Goldston, J. Pintz and C. Yildirim, Primes in tuples IV: Density of small gaps between consecutive primes, 
larXiv:1103!5886l l [math.NT], (2011) 

[8] T. Freiberg, Strings of congruent primes in short intervals II. [arXlyTTl lO.6624 [math.NT], (2011). 

[9] G. Harman, Trigonometric sums over primes I, Mathematika, 28, (1981), 249-254. 
[10] H. Iwaniec, E. Kowalski, Analytic Number Theory, Colloq. Publications vol. 53, Amer. Math. Soc, (2004). 
[11] L.Kuipers, H.Niederreiter, Uniform distribution of sequences, Dover Publications (2005) 

[12] M. B.Nathanson, Additive Number Theory: The Classical Bases, Grad. Texts in Math, vol 164, Springer- Verlag, 
(1996). 

[13] J. Pintz, Are there arbitrarily long arithmetic progressions of twin primes? , An irregular mind, Szemeredi is 
70, (Editors: I. Barany and J. Solymosi), 525-559, Bolyai Soc. Math. Studies 21, Springer, (2010). 

[14] K. Soundararajan Small gaps between prime numbers: The work of Goldston-Pintz-Yildirim, Bull, of the Am. 
Math. Soc., 44, (2007), 1-18. 

[15] E. Stein, Harmonic Analysis: Real-variable Methods, Orthogonality and Oscillatory Integrals, Princeton Uni- 
versity Press, (1993). 

[16] T.Tao, V.Vu, Additive combinatorics, Cambridge Studies in Advanced Mathematics 105, (2006) 
[17] G.Tenenbaum, Introduction to Analytic and Probabilistic Number Theory, Cambridge studies in advanced 
mathematics 46, (1995) 

[18] T. van Ravenstein, On the discrepancy of the sequence formed from multiples of an irrational number, Bull. 
Austral. Math. Soc, 31, (1985), 329-338. 

UCLA Department of Mathematics, Los Angeles, CA 90095-1555. 
E-mail address: jbenatarOmath.ucla.edu 



